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ABSTRACT 

The use of the computer for medical information 
processing was introduced about a decade ago- Considerable inroads 
have now been made toward its applications to problems in medicine - 
Present uses of the computer, both as a computational and 
noncomputational device include the following; automated search of 
patients* files; on-line clinical data processing; fetal heart rate 
and fetal electrocardiogram data acquisition; heart auscultation by 
computer; metabolism data acquisition; pattern recognition by 
computer; medical diagnosis using the logic of the propositiorial 
calculus; computer simulation of diagnostic problem solving; and 
computer-based medical instruction- (Author/ JY) 
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Any scientific field that deals with large amounts of data to be analyzed, 
coordinated and compiled at rapid rates invites the application of high speed 
data processing techniques. The area of medical information processing is just 
such a field. Accordingly, scientists have used electronic digital computer 
technology in this area for a number of years. The computer as an aid in med- 
ical information processing has been cast in two quite distinct but complimen- 
tary roles; as an instrument of computation and as a noncomputational device. 
Sometimes the machine assumes both roles and this will be apparent in the fol- 
lowing sections. 

Patient Files CComputational) . 

Hospital patient files are made up of a collection of many people’s records 
For any given patient, the record contains information about him for the dura- 
tion of his stay in the hospital. Public Health statisticians or administrators 
and investigators interested in conducting epidemiological studies are frequent- 
ly concerned with some aspect of the total population of patients. For example, 
questions involving the incidence of a given diagnosis (in a geographic region 
or within a hospital) , and the association of certain symptoms with particular 



*This research was supported in part by the National Institute of Mental 
Health, Grant Nos. MH 07722-07 and MH NS 19427-01. 
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diseases, can be answered only by referring to the population of patient records. 

The efficient search of these records has been made possi.ble by automating 
hospital record-keeping. Such automation requires a method of standardizing 
the amount and type of information that is notated on each patient's record. 
Standardization can be achieved by developing a uniform format in terms of 
which each physician can input patient data. Along these lines, one investi- 
gator (Baruch, 1965; also see Frey, 1969) has developed special computer pro- 
grams that secure their input information from the user by means of dialogue. 
These input programs are Socratic in that the computer elicits answers from 
the clinician,, who in turn may address the computer - either orally or with a 
minimiM of button -pushing, or by means of other learned responses. The input 
device may be located near the computer, or at a remote station. 

So that a substantial data base or dictionary may be compiled in computer 
memory, the cooperation of a large number of hospital personnel is required. 

Thus it is not sufficient just to have the physician read-in the system signs; 
it is also necessary to enlist the help of nurses, aides, and pharmacologists. 
Once such cooperation is attained and the files are compiled, any statistical 
analysis that can be performed by a clerk can be executed by the computer. 

Data analysis procedures are available in the form of extensive libraries of 
"canned" statistical programs. 

Statistical analyses of medical records could help in making a clinical 
diagnosis. The computer, for example, might be presented with the case of a 
patient exhibiting low grade fever, sore throat, and persistent headaches. 

By comparing the given symptoms, plus additional history, and laboratory test 
findings, with similar file cases, the computer might find that in 75-6 of the 
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-.cases in which this configuration of symptoms and signs occurred the diagnosis 
is a sure case of mononucleosis, the other cases being distributed in other 
diagnostic categories. Obviously the physician might arrive at the same 
diagnosis without the aid of a computer, but in cases in which the combina- 
tion of symptoms presents a complex diagnostic problem, it might be helpful 
to have immediate access to a large file of medical case histories. 

The computer is a valuable aid to the physician also because it can be 
programmed to take into account the relative frequency of kinds of various 
diseases and it can arrive at an estimated probability for each of tho pos- 
sibility of diseases that the patient might have. In practice, diagnosis is 
never a simple, single-step affair. Several stages are involved because each 
increment of new data about a patient modifies the existing diagnostic picture. 
The advantage of the computer for this form of sequential decision making lies 
in its facility for immediate information retrieval and rapid computational 
capabilities. The drawback of the computer for medical diagnosis, obviously, 
is that it cannot do a physical examination, read a medical chart or convert 
narrative information for its use. All information from these sources must 
be transcribed onto a checklist. 

On- line Clinical Data Processing (Computational) . 

. The speed with which a digital computer performs analyses enables these 
machines to process data while they are being collected. Therefore in addi- 
tion to searching files, as above, computers can be used to perform simul- 
taneous computations as the data are being input. Such computers are called 
on-line machines, as distinguished from those that are off-Uaie, where 
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input-output chores are performed by separate auxiliary equipment. In the 
early stages of computer technology, off-line machines were necessary because 
input and output mechanisms, being electro-mechanical, were much slower than 
the conpletely electronic storage, processing, and control units of the com- 
puter. But lately, computer processing has been synchronized with such 
auxiliary equipment, performing many operations between successive inputs. 

Thus, between successive inputs (inputting data is still relatively slow), 
the coii5)uter has time to perform a number of calculations: 

The main advantage of on-line computa' ion is that it teiids to provide 
the user with immediate feedback. A fast computer, for example, could cal- 
culate the auto- and cross-correlations^ of an electrocardiogram (ECG). or of 
an electroencephalogram (EEG) while the record is being taken, thus providing 
valuable diagnostic information. On-line computation of symptom and sign 
data, either with a direct patient "hookup" or by having the session on=^line , 

^The auto -correlation function, which is a relatively recent form of math- 
ematical computation, provides a test of periodicity in a time series record- 
ing such as the electroencephalogram. When a periodic signal is present, even 
though hidden to the eye, this function will identify its periodicity. Cross- 
correlations permit comparisons of wave or phase characteristics between pairs 
of EEG recordings. 

^Direct hookups require a special purpose machine called an analog com- 
puter. These machines are equipped to accept electrophysiological data (ECG 
and EEG) directly. 

^Facilitation of ongoing interactions between man and the on-line com- 
puter requires the ability to have the computer accept and execute several 
programs concurrently in "a time sharing" fashion. 
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permits the computer to provide immediate advice about possible further tests 

to be performed on the individual patients. 

This advantage of on-line clinical data acquisition can perhaps be best 
illustrated by referring to some work going on in electroencephalography 
(Adey, 1965; Brazier, 1961; 1965). It is in this area of investigation that 
considerable attention is being given to problems besetting scientists who 
wish to collect eledtrophysiological data and to receive almost simultaneous 

feedback in the form of the data's analysis. 

Electroencephalography has its origins in the observation by Caton, in 
1875, that spontaneous electrical activity can be obtained from the brain of 
animals . The relation of th5 s activity to cerebral function was demonstrated 
by the observation that sensory stimulation could alter the ongoing electrical 
findings. It was not until 1929 that a German psychiatrist, Hans Berger, 
demonstrated that the electrical activity of the brain could be recorded from 
man. The recording of such activity through the intact human skull came to 
be termed electroencephalography, and after considerable controversy, began 
to be used in clinical medicine. 

Clinical electroencephalography concerns itself mainly with interpreting 
the EeG for the purpose of aiding the diagnosis of central nervous system 
disorders. The clinical electroencephalographer examines the record of a 
patient's EEG for signs of irregularity in the continuously oscillating 
brain wave forms. This examination consists of a visual analysis and search 
for specific amplitude, frequency, and wave form variations that conform to 
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empirically agreed upon signs of pathology (see Figure 1) . 



Insert Figure 1 about here 



Experimental neurophysiology, in contrast to its clinical counterpart, 
is more concerned with evoking particular wave~£orms and manipulating the 
brain structures that under ly the pattern of this electrical activity. And 
it is here that most of the interesting computer analyses have been conducted. 
Unfortunately most wave-forms that are currently of greatest interest to the 
experimental neurophysiologist are of little value to the clinician. But the 
future of these computer analyses and its applications to EEC recording and 
interpreting holds considerable promise for the clinician. 

Typically the experimentalist records EEC data on special-purpose mag- 
netic tapes and these data are converted from analogue to digital computers 
or they are "digitized''.^ This procedure can occur off-line as well as on- 
line; but when it is off-line the lag between EEG recording and its analysis 
may be anywhere from several hours to many days. However, all the steps 
involved in EEG recording, digitization, and interpretation can be symchronized 
for on-line computation. The advantage of on-line computation, of course lies 
in the immediate feedback that is thus provided. One such on-line procedure 



^Voltage wave forms such as are obtained in EEG recording cannot be directly 
read into a digital con 5 )uter, since they do not contain digits. Also EEG in- 
puts occur as a function of time. Therefore, these data must be converted 
into digital form, with special attention to preserving the time function. 
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appra.isal. Such an appraisal may dictate that immediate delivery of the fetus 
is essential, or it may signify that conditions are normal. Stethoscopic 
sampling of the FHR, which is presently the most commonly used technique, 
does not provide a complete picture of these measurements, since it is im- 
possible to record these data continuously throughout labor and delivery. 
Moreover, with continuous recording techniques, it would be possible to cor- 
relate FHR and FECG patterns with various operative procedures, and with 
different phases of childbirth. But such automation, and the accompanying 
advantages, are far from being operational at the present time. Unfortunate- 
ly, current costs, limited computer programming sophistication, and hardware 
considerations still present problems that await solution prior to realizing 
such computer system. 

Heart Auscultation by Computer (Computational) . 

Physicians differ widely in their perception and interpretation of heart 
murmurs obtained by clinical auscultation. Heart auscultation is typically 
performed by placing a stethoscope on the left lateral position of the 
patient and listening to the precordium for valvular heart anomalies. Because 
the diagnosis of heart disease depends on the outcome of auscultation, a group 
of investigators (Taranta, Spagnuolo, Snyder, Gerbard, § Hofler, 1964) con- 
ducted a study in which they tested the accuracy of this method. They dem- 
onstrated that experienced physicians differentiate normal from abnormal 
heart conditions by the method of auscultation with success rates of 4 per- 
cent false positive and 37 percent false negative. 

These investigators then conducted an automated analysis of 
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phonocardiograms . The first step in this procedure was to tape record the 
heart sounds by means of a special-purpose audio-visual recorder. These 
tapes were next transcribed onto a continuous (analog) linear tape, and were 
converted for the digital computer • The digitized recording was then sub- 
mitted to numerous multivariate statistical analyses by computer. The overall 
success rate was significantly higher (7 percent false positive and false 
negative) than clinical and tape-recorded heart auscultations. 

The prospects for the future are good for providing thfe surgeon with con- 
tinuous heart auscultation data during his operative procedure. And these 
data, when considered together with other electrophysiological recordings, 
all being analyzed and interpreted almost simultaneously, should add to the 
clinician's diagnostic accuracy. The costs for such on-line systems are con- 
siderable, however, and large time-sharing computers to process these data 
are still not readily available for routine use. In the meantime, many com- 
putations can be performed off- line j and if the diagnostic success rate con- 
tinues to be consistently higher than that of clinical auscultation, then the 
computer is an important clinical adjunct even in its present form. 

Human Metabolism Data Acquisition by Computer (Computational) . 

The observation of recurring patterns in the metabolism of humans, par- 
ticularly as reflected in the urine composition, plays an important part in 

^The same physicians who performed the clinical auscultations listened to 
these recordings. Their success rate was about equal to that obtained after 
they had performed the clinical auscultations. 
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i diagnosing the state of the organism at any point in time. Metabolism patterns 

ate affected by many factors, particularly normal diurnal periodicities, 
normal schedules, and numerous other environmental changes, such as room 
temperature and position in bed. The problem of acquiring metabolism data 
from spinal-cord injured quadriplegics is currently being investigated in a 
multidisciplinary project at the Highland View Hospital in Cleveland (King 
§ Apple, 1966). Although, this project is oriented toward manipulating ex- 
perimental variables, the value of such data acquisition for clinical decision 

making should be apparent. 

The system is designed so that urine data are collected from a continuous- 
flow indwelling catheter; and bladder temperature, peripheral temperature, 
pulse, respiration, and muscle activity are measured by telemetry. Although 
the system is still not completely automated, such automatic data collection 

is clearly within view. 

The biochemical analysis is automated and consists of transporting the 
urine sample out of the patient's room to a large cold room where it is weighted 
and measured for the presence of sodium, potassium, creatinine, chlorine, and 
nitrogen. The data are presented in analog form and then digitized for com- 
puter analysis. Clearly the objective of a fully automated system will be to 
provide the physician with an arrangement whereby he can obtain almost im- 
mediate feedback.- From an instrumentation point of view, the major problems 
encountered in fully automating this system consist mainly of integrating it 
with the patient, while at the same time designing it to input data that 
I satisfy the requirements of the digital computer. When these human and 
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hardware problems are resolved, the physician will have access to one other 

tool to aid him in arriving at an accurate diagnosis.^ 

Pattern Recognition (Computational and Noncompu tational) . 

A pattern is a form which is characterized by a definite arrangement or 
interrelation of parts. The letters A, 3, and C, for example, are alphabetic 
patterns. ECG and EEC tracings are also patterns. There are two main as- 
pects in programming a computer to recognize patterns: the perceptual and the 

cognitive. The first of these - beyond which present-day computer technology 
has not made notable progress - consists of programming the computer to recog- 
nize that the same class name should be assigned to different manifestations 
of the same pattern or form. For example, small circles and large circles 
must be recognized as circles; and short As, tall As, fat As, and sloppy As 
must be recognized as an A. Perceptions of this kind are trivial for humans, 
once they have been taught to differentiate one pattern from another and to 
assign class names to various patterns. And an electroencephalographer has 
no more difficulty differentiating between, let us say, alpha and delta rhythms 
than does a child in discriminating between the letters A and B. 

The second, or cognitive, aspect of pattern-recognition programming is 
concerned with interpreting the identified forms. That is to say, many be- 
ginning medical students can learn to differentiate between an EEG alpha and 



^For reports on computer-based monitoring systems and a comprehensive 
patient monitoring concepts, the reader should see articles Warner, 1969 and 

Stacy, 1969. 
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delta rhythm and can be taught to recognize imusual wave- forms such as "spike" 
or "dome" discharges, but they may need many years of additional experience 
to interpret (or to learn to attach correct clinical significance) to these 
recognized patterns. Moreover, understanding the in-context meaning of 
particular waveform occurrences within an individual ' s EEG record requires 
a form of cognitive pattern recognition which is more complex than just per- 
ceptual identification. 

Generally speaking, even perceptual pattern recognition by computers has 
proved more difficult than was initially anticipated. This skill of the 
human to recognize patterns, upon close investigation, turns out to involve 
many conplex processes. One reason for the difficulty in programming the 
computer to recognize patterns lies in the difficulty the human has in expli- 
cating his ongoing process. A human who abstracts a pattern often is unaware 
of the basis of his classification. Asked to define the pattern he has 
recognized, the human may point to examples of similar patterns instead of 
explicating its characteristics. Thus, if pressed the human says he recognized 
the letter A, or a particular circle, because it resembles an A or a circle. 

In other words the human's pattern recognition is a consequence of experience 
or learning, and if asked to introspect about the process he says that he is 
matching the new pattern with one that he has in mind. 

The simplest automatic pattern-recognizing programs are based on a 
straight-forward matching notion (Green, 1963) . They compare the new input 
with a stored ideal version of each alternative, and the closest match compares 
the input with a stencil of each letter; the most closely fitting stencil in- 
dicates the choice. 
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But there are difficulties in this simple matching procedure. For while 
it seems reasonable to store perfect patterns and to retrieve and match these 
ideals with new inputs, the question of handling inputs of imperfect patterns 
or variations of the ideal patterns presents a problem. Occasionally this 
problem can be surmounted. Devices for reading printed numbers from checks 
get around the problem by creating artificial differences in size and shape 
between numerals to accentuate the differences. And plausibly, EGG or EEG 
tracings could be thus accentuated. However, the imperfections and variations 
are often so large that there is minimal correspondence between inputs and 
stored ideals. Generally, therefore, pattern recognition by computer has 
favored methods that analyze inputs. 

The strategy usually followed in analyzing inputs has been to select sev- 
eral attributes of the alternative patterns and to characterize the inputs, 
as well as the alternative patterns, in terms of these attributes. Thus an 
attribute of hand- lettered characters, for instance, might be number of 
curves. Each pattern has a particular value of each attribute [for example, 

R has one curve (and two straight lines)]. The combination of an attribute 
and its value is called a property, so that R has the property of one curve 
and two straight lines. In practice, to recognize a given pattern, the 
computer compares the observed properties of the input object with the stoied 
distributions of -values for that pattern. The machine then computes the 

d 

likelihood that the input object is an instance of stored pattern. There 
are two such programs that use the strategy of analyzing inputs, whicn are 
of potential use in recognizing patterns such as are obtained in medical 
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settings. These are MAUDE (Morse Automatic Decoder), described by Gold (1959), 
and Pandemonium , used and described by Doyle (1960, by Selfridge and Neisser 

(1963), and by Kleinmuntz (1969, 1970).. 

Many other, more complex pattern-recognition computer programs are being 
developed (see Fogel, Owens, § Walsh, 1966; Uhr, 1966; 1970); and applications 
of these prograrjs for medicine are apparent. For example, once the computer 
learns to recognize patterns of shadows and spots on x-rays, it could be 

g 

programmed to differentiate diseased from nondiseased chest x-rays. Like- 
wise, once the machine learns to recognize EEG patterns (for example, alpha 
rhythm, spikes, domes, petit mal variants) it can be taught to undertake the 
next, or interpretive, phase of EEG reading. And if the error rates can be 
minimized for both the perceptual and the cognitive performances of these 
programs, then perhaps computers can become valuable consultants in these 
areas. The rather unhurried pace of progress in pattern recognition, how- 
ever, suggests that these future developments are far from imminent. 

Logical Reasoning in Medical Diagnosis (Computational and Noncomputational) . 

Using a combination of symbolic logic (propositional calculus) and proba- 
bility theory, two investigators (Ledley § Lusted, 1959; Ledley, 1962) 
demonstrated that the computer could be an aid to the physician in arriving 

^Sterling and Perry (1965) recently described an interesting pattern-recognition 
program, which is designed to visualize radiation-dose distributions used during 
radiotherapy. Basically the program provides the therapist patterns of dose 
distributions resulting from the configurations of treatment conditions. 
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at a diagnosis and in choosing an optimum treatment plan. 

The use of logic for this purpose is closely related to the concept of 
symptom-disease complexes (SDC) . According to these workers, a symptom com- 
plex is a list of the symptoms that a patient does and does not possess; a 
disease complex is a similar list of diseases. An SDC is a list of both 
symptoms and diseases that a patient does and does not have. As an example 
of this, consider Figure 2 where two symptoms, and S 2 , and two diseases 
and D 2 are presented. Each column represents an SDC, where a unit in the 
row signifies that the patient has the corresponding symptom or disease and 
zero signifies that the patient does not have the symptom or disease. Thus 
the column in the rectangle of Figure 2 represents the SDC of the patient 

having S^^, not having S 2 , having D^^, and having D 2 . The columns of this 

' ^ 

figure represent all conceivable (by applying the truth tables of symbolic 
logic) SDCs that can be formed from two symptoms and two diseases. 

Insert Figure 2 about here 



Ledley (1962) uses the phrase "all conceivable SDCs," but then points 
out that not all of these are possible or actually occur. That is to say, 
"all conceivable SDCs" as computed by the truth table method does not neces- 
sarily correspond to what is possible according to medical knowledge. The 
effect of medical knowledge on the shape of the truth table matrix is to 
reduce the totality of all conceivable SDCs to those that are con^atible 
with the assertions embodied in the medical knowledge. Figure 3 represents 
all possible SDCs for the situation in which application of knowledge has 

IS 
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Figure 2: Logical basis for two symptoms, and and two diseases, 

and D2. 
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reduced the logical basis of possibilities. 



Insert Figure 3 about here 



The preceding description is somewhat abstract (and oversimplified) ^ 
and therefore it may be helpful to reify it with an illustration. Using the 
reduced logical basis of SDCs presented in Figure 3, suppose a particular 
patient presents the following symptom complex: He does not have but 

does have S 2 , written symbolically, this is 

ii ■ 

Where the "J' bar represents NOT and the dot represents AND. To make the 
diagnosis, consider Figure 3, which contains the reduced basis of all possible 
SDCs for those columns that include the symptom complex S^. There is one 
such column (rectangle) , and this informs the diagnostician that the patient 
with this symptom complex has disease complex Dj^D 2 - In other words, the 
diagnosis is that the patient has D^, but not D 2 « These truth table matrixes 
are stored in the computer, and the machine is programmed to search, detect, 
and print-out the diagnosis in the form of the logical possibilities. Al- 
though a computer is not essential for this storage and retrieval exercise, 
considerable time and effort are conserved by the use of the machine. 

Although such computer analysis may seem elegant at first glance, the 
fact of the matter is that SDCs are not all-or-none affairs. The problem of 
diagnosis really revolves about the following question: Given the patient's 

present symptom-complex, what is the probability that he has a particular 
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Figure 3: Reduced logical basis resulting from applying medical know- 

ledge to the truth table of Figure 2. The rectangle is the 
case of a patient having * S2 (see text) . 
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MdM tJul i. tot »»mm and is a job best left to computers. 

SiaMUn«» cunual 5if- CNoncomputational). 

yy tJUt th# WNi of < >mputers in medical settings had its pri- 

mmry Mfluol* o» gottlM • arriving at a diagnosis. 

Aft ofttlroly 44#fftfftftt rotoftfcll *' rategy has been followed by some investi- 
gftlofft (aftllftftft. m«ft4. 4 Ii*f and, 1965), who are more interested in study- 
tftg Thl 4lAjftO*liC 4ftCltlo^ process of physicians than they are in developing 
• C0ft|Ai4of pfftfifMi th^', accurately classifies diseases. Some of this research 
A * — .,oed in detail here. 

It was within the context of complex information processing by computers 
developed by Newell, ^ (1961; 1971) that we (Kleinmuntz, 1965; 1968; 

Wortman, 1965) undertook a number of studies involving the diagnostic problem 
solving of clinical neurologists. The two fundamental assumptions guiding 
these studies were 1) that clinical decision making is a special instance of 
problem solving and 2) that problem solving behavior can be studied by com- 
puters as a complex information processing phenomenon. 

The distinction between the complex information processing approach and 
numerous statistical decision theory strategies is that the former s goal is 
to parallel rather closely the consistencies, inconsistencies, errors, and 
shortcuts of the human diagnostician, whereas the statistical approaches are 
intended to produce a decision-making system that is an optimal or best 
possible system. Thus the main distinction is that the latter strategy 
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strives for optimality and the former for accuracy in reproducing thinking. 

The advantage of the complex information processing approach is that its 
emphasis is on simulating the human clinician and as such may yield informa- 
tion that may be useful for modeling him, which in turn could be used as a 

teaching device. 

To learn about neurologists' diagnostic search strategies, we devised 
a scheme which allowed the clinician to "think aloud". This was a method 
comparable to the one we used successfully in Q-sorting MMPI profiles 
(Kleinrauntz, 1963), but yet appropriate to the neurology problem. A variant 
of the childhood game of "Twenty Questions" fulfilled these requirements. 

Tne game is played by having one player, called the experimenter, think of a 
disease, while the other player, or subject, tries to diagnose the disease 
the experimenter has in mind. The experimenter can play any of a number of 
roles 1 he can pretend, for example, that he is a patient suffering from 
symptoms x, y, and z; or he could assume the role of the neurologist who is 
thinking of a particular disorder that is characterized by symptoms x, y , and 
z. The diagnostician’s job in either case is to inquire about the presence 
of certain symptoms, signs, or biographical data and he may, if he chooses, 
ask for certain laboratory test results. It is necessary that the experi- 
menter be an experienced neurologist so that he can answer the subject s 
questions. He must be able to recognize the appropriateness of many symptoms, 
signs, and laboratory tests that might possibly be relevant for a particular 

disease. 

These games are tape-recorded, and the end product, after appropriate 




editing, can be represented by a tree structure (see Figure 4) in which each 
point, or node, in the tree has exactly one connection to a point closer to 
the top or root of the tree. The starting point at the top, or the root, of 
the tree is the subject's first question. All subsequent questions are the 
tests that are performed at the various nodes of the tree. Each nodal point, 
then, represents the question asked by the subject or diagnostician j if the 
experimenter's reply is negative, then there is a branch to the left, if a 
positive reply is obtained, there is a branch to the right, dnless a diag- 
nosis has been reached each node is connected to exactly two lower nodes and 
through them to any number of still lower nodes. A path is a collection of 
lines from the root of the tree to a terminal node and is the schematic re- 
presentation of the search strategy used by the neurologist to arrive at a 
diagnosis. In other words, the tree structure is a representation of the 
diagnostician's solution path from a certain set of givens to the diagnosis. 

In order to illustrate this procedure, we present one such game in 
detail in Figure 4. The diagnostician was given the information that he is 
to diagnose the case of a 57-year old white female with aching and weakness 
in the lower extremities. 



Insert Figure 4 about here 



Inspection of this tree structure discloses that from the point at which 
the diagnostician asked the first question ("Was this gradual in onset?") 
until a diagnosis was reached, there were exactly eleven questions or test 
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nodes and ten binary branchings. Of the latter, seven were positive and the 
remainder were negative branchings. But more important than these descrip- 
tive features of the diagram is the fact that we have the diagnostician's 
solution path, which presumably is related to the way he might function when 
confronted with a patient exhibiting similar symptoms. 

In order to gain a further glimpse of this neurologist's reasoning 
process, he was instructed during a subsequent session, to state his reasons 
for asking each of his questions. Table 1 contains the same information that 
was presented in Figure 4 with the addition of the neurologist's stated 
reasons. Having the neurologist articulate his reasoning provided us with 
a rich "thinking aloud" protocol, while at the same time it allowed us to 
secure a test-retest reliability estimate of the size and structure of the 

tree structure. 



Insert Table 1 about here 



There are numerous promising possibilities open to us in the way these 
data could be best utilized. The most obvious of these uses is to collect 
a large number of protocols from one neurologist. These protocols would 
serve as a data-base, which could be stored in the computer, and, along with 
the clinician's heuristics, would represent one neurologist's method of 
diagnostic problem-solving. Ideally, the neurologist selected for such rep- 
resentation would be a highly proficient one. But one need not be so de- 
limited. Conceivably, proficient and non-proficient diagnosticians could be 
simulated and then their strategies compared. 

2-r? 
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Table 1 - Protocol of a Diagnostician’s Questions and Stated 
Reasons of a Disease Characterir.ed by Aching and 
Weakness in the Lower Extremities in a 57-year ^old 
White Female (See Figure 4) . 



Question and Answer at Test node 



Stated Reason 



Gradual in onset? (Yes) 



Any paraesthesia in her legs? (Yes) 

Any objective sensory signs? (Yes) 



Decreased vibration sense in her 



feet too? (Yes) 

Is her muscle weakness worse 

proximal ly? (Yes) 

Did the lady have normal spinal 

fluid? (No) 

High protein? (Yes) 

Between 100 and 200? (No) 

Under 100? (Yes) 

Is she diabetic? (Yes) 

Ok, the diagnosis is proximal 

diabetic neuropathy. (Yes) 



It is very important to narrow it 
down to one of two categories. That 
is between degenerate or tumor dis- 
orders and infections and vascular 
diseases. 

This is just to implls^te the sen- 
sory tracts . 

This includes decreased position 
sense which is a posterior column 
sign usually. 



Both carried in the same column, 
and I’m just reaffirming this. 

Different diseases give you dif- 
ferent proximal vs distal weakness. 

I want to see if there is increased 
protein and cells in the fluid and 
therefore if she has tumor vs in- 
flammatory disease. 

This is just one of the constituents - 
cells, protein and perhaps glucose. 

If it is markedly increased, it 
could be a tumor. 

Normal is up to 45. Therefore^ 
let’s think of diabetes. 




Individual differences among diagnosticians could also be ascertained 
by our laboratory procedures. That there are individual differences among 
diagnosticians in their ability to attain correct decisions is obvious. Not 
so obvious, however, are ways and means to measure those differences. Using 
the tree structure approach to diagnostic problem solving, we have been able 
to demonstrate graphically the existence; of individual differences among 
diagnosticians. A comparison of the trees yielded by experienced and inex- 
perienced neurologists, for example, disclosed that the latters' structures 
contain many more nodes, more naive questions and search strategies that 
yield considerably less information per question than those of experienced 
neurologists. 

Finally we can mention a third possible use to which our tree structure 
approach could be put. Based on the data collected from all proficient 
neurologists who participated in the diagnostic games, sufficient information 
has been collected to construct a medical model which resembles optimal 
diagnostic problem solving, using the data base of this model, as well as 
the heuristics for accessing this data base as inputs to a computer, we can 
utilize the computer as a tutor to students who will sit opposite it. Thus, 
upon command the machine will print out a set of givens (i.e., fever and 
headache in a twelve year old) , and the student can play the role of the 
diagnostician. Or the problem can be turned around. Rather than having the 
computer serve as a tutor and the subjects sitting opposite it, we can arrange 
the situation so that the computer attempts to achieve a diagnosis, and the 
expert neurologist will answer its questions. This is perhaps closer to the 



situation as it may ultimately exist where the machine acts as a backup 
diagnostician for the human clinician who may consult it for provisional 
diagnoses during his own work-up of a patient. In either case, whether 
the coii5)uter is tutor or diagnostician, the implications for its use in 
medical education, a topic which we will develop further below, is obvious. 
Computer- Based Medical Instruction CNoncomputa tional) . 

The idea of programmed and computer-based instruction has its roots in 
research with "teaching machines". Programmed .instruct ion is an educational 
technique in which a series of instructional materials or items is presented 
to the student, who must select a correct alternative or fill in a missing 
answer. The distinguishing feature from conVvJntional instruction is that 
the program is designed to provide the student immediate feedback from which 
he can determine whether his answer was right or wrong. The device by which 
programmed instruction is conducted is called the Teaching Machin e. The latter 
may range in complexity from textbooks with special page format (to allow feed- 
back) , to elaborate electromechanical machines in which the program may be 

under the control of a digital computer. 

Programmed instruction is among other things a response to the challenge 

of providing for individual differences among learners. A properly programmed 

tutoring device provides each student with the opportunity to work at his own 

pace, at the same time allowing him to make as few or as many errors as are 

necessary for him to learn the material completely. Moreover, such automated 

instruction relieves the human teacher of his routine, repetitive, and drill- 
* • 

master role, thus permitting him to concentrate more on the motivational. 
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social, and inspirational aspects of teaching. 

The application of data processing and computer technology to the function- 
ing of such programmed instruction is a relatively recent development (see Fiath, 
Andersen, d, Brainerd, 1959; Silberman § Coulson, 1962) when compared to the 
idea of an automatic tutor (see Pressey, 1926; Skinner, 1958). Tliis idea has 
only recently materialized in the installation of computer- linked typewriter 
input stations in classrooms and laboratories for faculty and student use. 

Among the more significant computer-based instructional programs currently 
being developed outside the medical field are the following (Goodlad, O’Toole, 

§ Tyler, 1966): a program that teaches students to count in the binary-number 
system and to add, subtract, multiply, and divide and, further, to convert 
from decimal numbers to binary ones and from binary to decimal numbers (Rath, 
et al 1959) ; a special program of feedback messages designed to provide com- 
puter programmers with knowledge of where they made errors in their programs 
(Perlis, 1959; 1963); an elementary school mathematics program (Suppes, 1965), 
and PLATO (Programmed Logic for Automatic Teaching Operations), a general- 
purpose teaching machine system. Since the PLATO system is most closely re- 
lated to computer-aided research currently being conducted in medical educa- 
tion, we will dwell in some detail on this program (Alpert § Bitzer, 1970, 
Bitzer § Easley, 1965) before discussing computer-aided medical instruction. 

At the Coordinated Science Laboratory of the engineering department at 
the University of Illinois, a group of scientists have been developing PLAIO. 

4 t 

The PLATO system utilizes a computer as the central control element for teach- 
ing a number of students simultaneously, while allowing each student to proceed 
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through the lesson material independently. The system provides for communica- 
r tion in two directions. Each student is provided with an electronic key set 

[; as a means for responding to the computer's queries and a television screen 

I for viewing information selected by the computer. There are two sources of 

I information which are usually displayed on the student's television screen: 

I ■ . 

t; 

I a bank of slides prestored in an electronic slide selector under the control 

I 

I ox the computer and a computer- control led storage tube that permits plotting 
I (at each of many student stations) diagrams, symbols, and words on the stu- 
I dent's storage tube. 

r PLATO has two systems of teaching methods, a "tutorial logic" and an 

I 

^ "inquiry logic." The first of these is design.ed to lead the student through 

I; a fixed sequence of topics and provides branching contingencies that depend on 

I 

the student's speed of learning. The system first presents facts and examples, 
[ as is illustrated in Figure 5, and then poses problems covering the material 

I presented. The student selects answers and, when he is ready, asks PLATO for 

I 

feedback. When he finds the problems too difficult he may choose to branch 

t 

I to easier material. The lesson material is organized in two sequences, the 

I • 




main sequence consisting of the minimum material that must be used by all the 

v/V 

students, and the "help" sequence that is designed for students who have dif- 
ficulty with quesitions in the main sequence. 

The student begins by viewing test material in the main sequence. When 
he has completed reading a page cf text, he pushes the button labeled CONTINUE 
(see Figure 5) and thus proceeds to the next page, or he can return to the 
preceding page of text by pushing the button labeled REVERSE. Generally, 
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students proceed through the lesson material by using the following logic 
buttons provided to them on a keyset: RENEW, ERASE, AHA, JUDGE, REVERSE, HELP 

AND CONTINUE. 

Questions are answered by having the student type the appropriate res- 
ponses with the use of buttons similarly labeled to those of a typewriter. 



Insert Figure 5 about here 



His answer appears on a cathode ray tube almost simultaneously. The 
student then may push the JUDGE button, and the computer determines the ac- 
ceptability of the answer by printing out OK or NO next to the answer. By 
using the ERASE button, the student 'an remove incorrect answers. He can 
push the HELP button if he has difficulty with the question, and this would 
take him into a help sequence that pertains to the question. 

The lesson materials are prepared for the tutorial logic by arranging 
them in a set of slides with at least one help slide for each question in the 
main sequence. Also a parameter tape is prepared, which contains the answers 
to the questions, their location on the slide page, and the order in which the 
slides are logically connected. Finally, error categories must be specified 
(for error detection) and a list made of monitored problems and their 
criteria for .^valuation. 

The inquiry- teaching logic system permits dialogues between the student 
and the computer. This system came about as a result of an interest in giving 
the student even more control of the way he learns than the tutorial system 
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A flow-chart of the tutorial teaching logic. See text for a key 
to the abbreviations. (Froir Bitzer, 1965). 
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permits. The inquiry system allows the student to ask questions of the com- 



puter. Typically, in a lesson by this system, gp’^eral problems are presented 
to the student. To solve them he must request and ^Tganize appropriate in- 
formation from the computer. The student may be asked to demonstrate his 
achievement by answering questions and he may also ask questions within a 
given range of possibilities in order to obtain information. 

The innovation of the inquiry logic is that this system provides the 
student a syntax by which he can ask questions about the lesson he is study- 
ing. Thus, for example, in the tutorial logic, the student communicates with 
the coii 5 )uter either with one of the control requests (that is, "Turn the 
page"; "Give me help"; "Judge my answer"), or composes short answers that 
usually must match one of the alternative stcred responses. If he types a 
question such as "What is a coefficient?" the computer might respond with 
a NO, since it treats his response as an answer. The inquiry logic system, 
on the other hand, provides the student with the syntax and hence with the 
opportunity to ask such questions. 

This syntax essentially requires that the student ask questions in a 
specific coded format that permits retrieval of information stored in the com- 
puter. One example given by the originators of the system (Bitzer § Easley, 
1965, p. 98) is as follows: The student in a teaching program for nurses 

who would like the answer to the question "What's the effect of administering 
nitro-glycerine on the heart rate of the patient?" must type the coded format 
for the following sequence of phrases: "Return patient to original state"; 

"Give drugs"; "Select nitroglycerine"; "Check condition of patient, vital signs. 
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pulse rate" (at this point the computer answers the question originally posed). 

Similarly several researchers at Bolt, Beranek, and Newman (Feurzeig, 
Munter, Swets, § Breen, 1964; Swets § Feurzeig, 1965) have developed a com- 
puter teaching system and applied it to a problem in teaching medical diagnosis. 
This system is designed to enable the medical student to augment his experience 
and his diagnostic skill. 

The Bolt, Beranek, and Newman computer program, called the "Socratic 
System" because of its inquiry format, states a problem (that is, a medical 
history and a request for a diagnosis) to a student and engages him in "con- 
versation" while he solves the problem. The conversation, of course, is accom- 

9 

plished through the use of an electric typewriter connected to, anci controlled 
by, the computer. The student types a question or an assertion and the machine 
types back a response - an answer, a comment, or another question. This conver- 
sation is executed by supplying the student with a list specifying the vocabu- 
lary for the problem. During the session, the student's questions and declara- 
tions must be selected from the terms on this list. 

The vocabulary can be extensive, and the student is allowed considerable 
freedom in his approach to solving the problem. He can specify the information 
he wants and can make assertions regarding the solutions. The system records 
the information given to the student and answers each of his questions by 

0 

For a discussion of some of the problems to be faced in designing a 
computer-based tutor to deal with written and spoken language, see 
Spolsky (1966). 
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typing one or more replies from a prespecified set of responses. At any 
point in time, the computer response depends on what has just been said and 
on everything that preceded it. Thus the system carries on a tutorial 
dialogue with the student, one in which interesting contingencies may be 
j developed. 

I Computer-aided teaching of medical diagnosis, which also has been tried 

I in at least two other studies (Entwisle § Entwisle, 1963; Kirsch, 1963), 

I 

I raises many interesting possibilities for introducing programmed instructional 

i 

t techniques into the medical school curriculum. The clinical experience of 

j 

I medical students is necessarily restricted by the variety of case material 

i: 

i available to them and their limited exposure to live patients. Thus, a stu- 

i 

I dent may not see any cases of a particular disease during his school years, 

I and some cases he will encounter in his later practice of medicine may differ 

I 

I significantly from the classical or textbook cases. The student’s "probability 

I 

? matrix", built up from his personal experience with a limited number of cases, 

may differ considerably from a matrix that is based on a large number of 
i cases. Practice with programs like the one previously described may permit 

students to enlarge their probability matrixes. 

These computer programs, then, are capable of generating a vast number 
I of hypothetical patients beckoning to be "examined" by inexperienced medical 

1 students. The limited usefulness of these programs must nevertheless be 

s recognized. The medical student is not able to view the "acutely prostrated" 

[ 

I: patient; he gains no e:jq)erience in obtaining heart auscultation; and general Jy, 

5 ; 

l' 

r' 
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he conceptualizes, rather than practices, medicine. Moreover, the artificial 
setting of the man-machine interaction does not drive home the lesson of the 
"real-life" consequences of performing an inappropriate procedure, nor does 
it confront the neophyte physician with the leality of the discomforts 
physical and economic - that accompany his selection of particular laboratory 
tests or exploratory procedures. But then again, these disadvantages must be 
weighed against the advantages of the added experience he gains that may 
otherwise be unavailable to him. 

Summary and Conclusion 

The use of the computer for medical information processing was introduced 
about a decade ago. Considerable inroads have now been made toward its ap- 
plications to the problems in medicine. The future of the computer as an 
information machine and decision maker in clinical areas is limited only 
by human ingenuity. There seems to be no lack of such ingenuity in developing 
the software and hardware that are essential for its usefulness to the clini- 
cian. How far or how quickly these developments will advance is difficult 
to predict; but some of promising present uses suggest that an interesting 

decade is ahead. 

Present uses of the computer, both as a computational and noncomputational 
device include the following: automated search of patients' files; on-line 
clinical data processing; fetal heart rate and fetal electrocardiogram data 
acquisition; heart auscultation by computer; metabolism data acquisition; 
pattern recognition by computer; medical diagnosis using the logic of the 
propositional calculus; computer simulation of diagnostic problem solving; 



and computer-based medical instruction. 
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